Model Selection

16K long context

# 16K long context

ALP DeepScaleR 1.5B C16K

ALP_DeepScaleR_1.5B_C16K is a model trained using the Adaptive Length Penalty (ALP) method based on the DeepScaleR-1.5B model, which can significantly reduce token usage while maintaining performance.

Large Language Model

Fathom R1 14B RS

Fathom-R1-14B is a project based on the R1-distilled-14B model, achieving o4-mini level mathematical reasoning ability under a 16K context with a low training cost of $499.

Large Language Model

FractalAIResearch

phi-4 is an open-source language model developed by Microsoft Research, focusing on high-quality data and reasoning capabilities, suitable for memory/computation-constrained environments.

Large Language Model Supports Multiple Languages

Aya Vision 8B is an open-weight 8-billion-parameter multilingual vision-language model supporting visual and language tasks in 23 languages.

Transformers Supports Multiple Languages

Featured Recommended AI Models

AIbase

Empowering the Future, Your AI Solution Knowledge Base

English 简体中文繁體中文にほんご

© 2025AIbase